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ABSTRACT 

The XBootes Survey is a 5-ks Chandra survey of the Bootes Field of the NOAO Deep Wide-Field Survey 
(NDWFS). This survey is unique in that it is the largest (9.3 deg^), contiguous region imaged in X-ray with 
complementary deep optical and near-IR observations. We present a catalog of the optical counterparts to 
the 3,213 X-ray point sources detected in the XBootes survey. Using a Bayesian identification scheme, we 
successfully identified optical counterparts for 98% of the X-ray point sources. The optical colors suggest that 
the optically detected galaxies are a combination of z < 1 massive early-type galaxies and bluer star-forming 
galaxies whose optical AGN emission is faint or obscured, whereas the majority of the optically detected point 
sources are likely quasars over a large redshift range. Our large area. X-ray bright, optically deep survey enables 
us to select a large sub-sample of sources (773) with high X-ray to optical flux ratios (U/U >10). These objects 
are likely high redshift and/or dust obscured AGN. These sources have generally harder X-ray spectra than 
sources with 0.1 <fv/fo <10. Of the 73 X-ray sources with no optical counterpart in the NDWFS catalog, 47 are 
truly optically blank down to R ~25.5 (the average 50% completeness limit of the NDWFS R-band catalogs). 
These sources are also likely to be high redshift and/or dust obscured AGN. 

Subject headings: X-ray Survey, AGN, Cosmology 


1. INTRODUCTION 

Active Galactic Nuclei (AGN) are complex objects which 
radiate across the entire electromagnetic spectrum. To gain 
a better insight into their nature and how they evolve, we re¬ 
quire large samples of AGN in multiple wave-bands. Previ¬ 
ous studies at optical and soft X-ray wavelengths have failed 
to obtain a complete census of the AGN population because 
dust and gas obscures many AGN from view. In fact in the 
local Universe, optically obscured AGN m ay outnumber opti¬ 
cally unobscured AGN by a factor of four JMaiolino & Riek^ 
Il995h . In all but the shallowest surveys, the number density 
of AGN identified in the hard X-ray and mid-IR bands is far 
greater than is found in optical surveys of comparable dept h 
iBauer et ^12004 iRisaliti & Elvis ll2()()4t IStern et alll20()5ll . 

Thus it is important to determine the extent to which there 
exists a hidden population of AGN whose optical properties 
do not identify them as AGN. Because of the ability of hard 
X-rays to penetrate dust and all but the most extreme column 
densities of gas. X-ray surveys with sensitivities above ~4 
keV can provide relatively unbiased samples of AGN over a 
range of redshifts. 

We summarise some of the exis ting extragalactic X-ray sur¬ 
veys in Table [p The Chandra l^eisskopf et al.ll2002i) and 
XMM-Newton dJansen et alJl2001h observatories have led to 
great advances in X-ray astronomy, producing surveys which 
are 10 - 100 times deeper than those by previous X-ray tele¬ 
scopes. Chandra's imaging resol ution is superior to all previ- 
ous and current X-ray telescopes ilvan Sneybroeck et~aT 
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iGarmire et al.ll2003ll : this is crucial for determining the cor¬ 
rect optical counterpart in deep optical imaging data in which 
the surface density of sources is large. While the deepest sur¬ 
veys have resolved ^80% of the hard (2-7 keV) X-ray back¬ 
ground into discrete sourc es dBrandt et alJl2Q()U iRosati et all 
EM iMoretti et eljl2(M iWorslev et alJl2n04ll and an even 
larger fraction of the soft (0.5-2 keV) X-ray background, they 
typically cover only small (~0.1 deg^) areas of the sky. Sur¬ 
veys covering a larger volume are needed to overcome cosmic 
variance and to better determine the properties of the most lu¬ 
minous and rarest sources such as powerful quasars, whose 
number densities are low. Large contiguous areas and ex¬ 
tensive multi-wavelength coverage are also necessary for de¬ 
tailed studies of AGN clustering and environment. 

Here we identify the candidate optical counterparts to the 
X-ray point sources in a new, medium depth (5-ks/pointing), 
wide-held X-ray survey performed with Chandra ACIS-I 
(XBootes; Murray et al. 2005) of the Boo tes held of the 
NOAO Deep Wide-Field Survey (NDWFS; l.lanniizi & Devi 
IT^ Jannuzi et al. in prep.; Dey et al. in prep.). The 
XBootes survey is unique because of its contiguous imag¬ 
ing coverage over a large (9.3 deg^) area with Chandra. 
While not as deep as many past surveys made with Chan¬ 
dra and XMM, the large co-moving volume surveyed al¬ 
lows us to obtain large samples of sources comprising the 
bright end of the X-ray luminosity function and to determine 
the nature of rare populations whose number densities are 
too small to obtain meaningful statistics in small area sur¬ 
veys. The large contiguous area is also critical for studies 
of AGN clustering: one of the key goals of the XBootes 
survey. We can also perform stacking analyzes of the X- 
ray data to determine the mean X-ray properties of different 
popul ations selected in different wavebands (e.g.. lBrand et all 
120051 Watson et al. in prep.). The Bootes held is unique 
in the extent of its multi-wavelength coverage over such a 
large area. The mul ti-wavelength d ata includes X-ray (Chan¬ 
dra), UV (GA LEX: iHoopesIpOOl . optical (NDWFS), near- 
IR (FLAMEX; lElston et alJUOOai . mid-IR (Spitzer/IRAC - 
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TABLE 1 

Properties oe recent extragalactic deep and medium-deep X-ray surveys ordered by increasing survey area. The 
X-RAY ELUX is QUOTED FOR THE 0.5-2 KEV BAND AND HAS BEEN CONVERTED TO THE ELUX ASSUMING A POWER-LAW WITH r=1.7 
WHERE NEEDED. THE OPTICAL MAGNITUDE LIMIT IS TAKEN FROM THE REEERENCED PAPER AND MAY BE DEEINED IN DIEEERENT WAYS. 

a. The LALAeield is in the north-east corner oe the Bootes eield. b. The number in parenthesis indictates the total 

EXPECTED AREA OE THE SURVEY WHEN COMPLETED. 


Survey 

Area 

deg2 

X-ray Flux limit 
ergs cm“^ 

R limit 
magnitudes 

Reference 

Chandra Deep Field North 

0.1 

2.4x10-*'' 

~26.4 

Barser et al. 120031. Alexander et al. (200^1 

Chandra Deep Field South 

0.1 

5.7x10-*'' 

~26.5 

Giaccon^^^2002) 

Extended Chandra Deep Field South 

0.3 

8x10-*' 

~26.5 

Lehme^^^2005) 

Chandra Extended Growth Ship (DEEP) 

0.1 

LOxlO-*** 

23.4 

N andra^^^_^2005 1 

Chandra LALA^ 

0.1 

1.4x10-'** 

25.7 

Wane et al. 120041 

SPICES-II 

0.1 

1.8x10-'** 

25.4 

Stem e^^2002) 

XMM Lockman hole 

0.25 

3.0x10-'** 

~27.1 

Hasinser et al. f2001).Wilson et a|^2004^ 

CLASXS 

0.36 

5.0x10-'** 

27.0 

Yang e^^^20041 

Chandra suiwey of 13h XMM/ROSAT 

0.25 

5.0x10-'** 

27.0 

McHard^^^^20031 

Chandra Multi-wavelength Project 

0.8 (14.0'’) 

variable 

24.0 

Kim et al. 120041. Green et al. (20041 

ROSAT Ultra Deep 

0.36 

1.1x10-*^ 

25.5 

Lehmani^^^20(^) 

XBootes 

9.3 

4.0 X 10 

25.5 

Murray et al. (2005), Kenter et al. (2005); this work 

XMM-Newton/2dF 

1.5 

4.0x10-'^ 

19.5 

Georuakakis et ^^2003) 

HELLAS2XMM 

0.9 (4.0'') 

7.5x10-*^ 

24.5 

Baldi et al. 120021 


lEisenhardt et al.l l2004t Spitzer/MlPS - ISoifer et al.l l2004l) . 
and ra dio (VLA/FIRST; fBecker. White. & Helfandll 19951 and 
WSRT: ld^ries et alJ2003r Optical spectroscopic follow-up 
observations have also been undertaken for all X-ray sources 
with I <21.5 as part of the AGES survey (Kochanek et al. in 
prep.). 

In Murray et al. (2005; Paper I), we described the general 
characteristics of the X-ray survey and determined the angu¬ 
lar clustering of the sources. In Kenter et al. (2005; Paper II) 
we presented the X-ray catalog. The main aim of this paper 
is to provide an accurate catalog of the optical counterparts 
of the X-ray point sources in the Bootes region which were 
presented in Kenter et al. (2005). We begin with a summary 
of the X-ray, optical, and near-IR data used in the XBootes 
and NDWFS surveys (Section 2). We discuss our method of 
associating the X-ray source with optical counterparts in Sec¬ 
tion 3. The robustness of the resulting matched catalog as well 
as the properties of the catalog are described in Section 4. In 
Section 5, we describe the X-ray and optical properties of the 
matched catalog. The main points of the paper are summa¬ 
rized in Section 6. 


2. THE DATA 

2.1. X-ray Imaging: the XBootes Survey 

The details of the X-ray observations for the XBootes Sur¬ 
vey and the resulting XBootes source catalog have been pre¬ 
sented in Murray et al. (2005) and Kenter et al. (2005). In this 
Section, we summarize the main survey characteristics. 

A 9.3 deg^ region of sky chosen to match the area covered 
with NDWFS was observed by the Advanced CCD Imaging 
Spectrometer (ACIS-I) on the Chandra X-ray Observatory 
over a 2 week time interval in March and April 2003. The 
data was taken in 126 separate pointings, each observed for 
« 5-k s. The CIAO 3.0.2 w avelet detect algorithm (WAVDE- 
TECTi lFVeeman et al.l2002h was used to detect X-ray sources 
in the total (0.5-7.0 keV) band data. A probability thresh¬ 
old of 5 X 10 ^ was chosen as the best compromise between 
maximizing the completeness while minimizing the number 
of spurious detections. Although the WAVDETECT algorithm 
provides a robust method of identifying real X-ray sources. 


it performs less well at providing an accurate source posi¬ 
tion and its uncertainties. The positions were instead esti¬ 
mated by iteratively centroiding the counts within the 50% 
encircled-energy radius of the Chandra ACIS point spread 
function. The total counts from each X-ray source were de¬ 
termined within the 90% encircled energy radius (see Murray 
et al. (2005) for details). No corrections were made for the re¬ 
maining flux expected to fall outside this radius. The counts in 
the soft (0.5-2.0 keV) and hard (2.0-7.0 keV) bands were de¬ 
termined in the soft and hard-band images using the position 
and 90% encircled energy radius from the total band image. 
The X-ray catalog comprises 3,293 unique X-ray sources with 
>4 counts in the total band images (Kenter et al. 2005). We 
expect only ^35 of these sources to be spurious in the full sur¬ 
vey (Kenter et al. 2005). For the matching with cataloged op¬ 
tical counterparts, we only considered the 3,213 X-ray sources 
which overlap with the NDWFS area (see Figure^. 

To estimate the X-ray flux for each source, a local back¬ 
ground was calculated and subtracted from the net counts. 
Telescope vignetting results in an effective exposure time that 
varies with X-ray energy and position on the image (drop¬ 
ping to ^20% near the edges of the field). To account for 
this, we multiply the background-subtracted counts by the ra¬ 
tio of the average exposure time (5-ks) to the effective ex¬ 
posure time at the source position. We also divide by the 
fractional effective area at the source position. Assuming a 
power-law spectrum with photon index r=1.7 and the Gal ac- 
tic HI column density of 1 x 10^° cm“^ dStark et al.lll992^ ■ a 
4 count source in our survey corresponds to 7.8x10“^^ ergs 
cm“^ s“\ 4.7 X 10“'^ ergs cm“^ s“', and 1.5 x 10“'"^ ergs cm“^ 
s“^ in the 0.5-7 keV, 0.5-2 keV and 2-7 keV bands respec¬ 
tively. This corresponds to a total band X-ray luminosity of 
F;c = 2 X 10"^'ergs s“', 4 x 10"^^ergs s“', and 6 x 10"^ergs s“\ 
atz=0.1, 1, and 3 respectively. 

2.2. Optical and Near Infra-red Imaging: the NDWFS 

The NOAO Deep Wide-Field Survey (NDWFS) comprises 
deep optical and near infra-red imaging in two ^9 deg^ re¬ 
gions of the sky in Bootes and Cetus, and is designed to 
study the formation and evolution of large-scale structure 
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llJannuzi & iBrown et al.ll2003l) . The optical and 

near-IR data for the Bootes field were obtained using the 
4m and 2.1m telescopes of the Kitt Peak National Obser¬ 
vatory between February 1998 and April 2004. In this pa¬ 
per, we consider Bw,R, and /-band imaging over almost the 
entire 9.3 deg^ of the NDWFS Bootes field (the field cov¬ 
ered by our X-ray imaging; Jannuzi et al. in prep.), and 
X-band imaging over approximately 6 deg^ of the Bootes 
field (Dey et al. in prep.). X-band imaging has been ob¬ 
tained for the entire Bootes field, but only 6 deg^ were avail¬ 
able at the time of this work. We use the NDWFS data re¬ 
lease version 3.0 which can be obtained through the NOAO 
science archive (http;//archive.noao.edu/nsa/i and NDWFS 
homepage (http;//www.noao.edu/noao/noaodeepi. We deter¬ 
mined the 50% completeness limits as a function of magni¬ 
tude by adding artificial stellar objects to copies of the data 
and recovering them with SExtractor. Details of the complete¬ 
ness as a function of NDWFS sub-field can be found in Jan¬ 
nuzi et al. (in prep.) and at the NDWFS homepage. The av¬ 
erage 50% completeness limits of the Bw, R, /, and X-bands 
images are 26.7, 25.5, 24.9, and 18.6 respectively. We use 
Vega magnitudes throughout. 

The opti cal catalogs were gener ated using SExtractor, ver¬ 
sion 2.3.2 (iBertin & ArnoutsI 19961) run in single-image mode 
with the minimum detection area, convolution filter, and sig¬ 
nal above sky threshold optimized to provide a deep yet reli¬ 
able catalog. The astrometry for the ND WES was calibrate d 
using matches to the USNO-A2 catalog (iMonet et al.lll991^ . 
resulting in an uncertainty in the absolute positions of the 
sources of < 0"1 (Jannuzi et al. in prep., Dey et al. in prep.). 
Detections in two separate optical bands were matched if the 
centroid of a detection in one band fell within the image el¬ 
lipse calculated from a detection in the other band or if the 
centroids of the detections were within 1" of each other. In 
the case of a detection in the /?-band, we used this to assign 
the position of the matched catalog; otherwise we used the 
position from either the Bw or /-band catalogs in that order 
of preference. We will generally use this ordering wherever 
we must assign a optical parameter to a source. Because the 
depth of the X-band data is not comparable to the optical data 
and may contain a higher number of spurious sources, we re¬ 
quired there to be a match in at least one of the optical bands 
in addition to the X-band detection for a source to be included 
in the single matched catalog. 

3. MATCHING THE X-RAY AND OPTICAL SURVEYS 

This paper is concerned with the optical counterparts of X- 
ray point sources in the NDWES Bootes field. It comprises 
27 optical imaging sub-fields (each roughly corresponding to 
the area of a single pointing of the KPNO MOSAIC-1 prime- 
focus imager) and 126 pointings of the Chandra ACIS-I cam¬ 
era. The positions and overlap of the sub-fields on the sky can 
be seen in Eigure[2 Over-plotted are the 3,293 X-ray point 
sources with > 4 counts. Although the surveys are extremely 
well matched in their spatial coverage, the difference in the 
size and shape of the individual optical and X-ray pointings 
led to small differences in their coverage. The X-ray sources 
represented by light dots are those that fall off the edge of the 
NDWES survey; these sources are not included in any further 
analysis. The total number of X-ray sources considered in this 
paper is therefore 3,213. 

3.1. Relative Positional Ojfsets between X-ray and NDWFS 
Imaging Frames 


Any systematic offsets between the astrometric reference 
frames used for the X-ray and optical imaging or errors in the 
astrometric zero-point of any of the individual optical sub¬ 
fields or X-ray pointings could adversely affect our proce¬ 
dure for matching sources. The NDWES images were placed 
on an astrometric frame defined by the fainter stars in the 
USNOA2.0 (The ACT astrometric frame: iMonet et aOl998t 
see Jannuzi et al. in prep, for details of NDWES astrom¬ 
etry). The X-ray astrometry is based on the AGASC cat¬ 
alog (ISchmidt & Greenll2003h and is referenced to the Ty- 
cho2 astronomic frame, which incorporates and supercedes 
the ACT astrometric frame. Although the X-ray and opti¬ 
cal data should be on the same astrometric frame, we need 
to check and correct for any systematic offsets that may be 
present. Because of the large number of X -ray sources with 
cataloged optical counterparts (see Section IT^ . we can use 
the data to determine and correct any systematic offset. 

We calculated the median offset between the positions of 
all 3213 X-ray sources and the positions of all matched opti¬ 
cal sources in the survey. Eor this process, we used a simple 
scheme in which we take the nearest optical source within ei¬ 
ther 1.5" or the 50% X-ray positional uncertainty to be the 
matched optical counterpart. The median offset for sources in 
all the X-ray pointings was 0."03±0"02 east and 0"53±0"02 
north of the optical positions. 

We determined whether there were any field-to-field varia¬ 
tions in the astrometric zero-point by calculating the X-ray to 
optical positional offset for the sources detected within each 
X-ray pointing and for the sources detected within each opti¬ 
cal sub-field. To identify this systematic offset we determined 
the median offset of all X-ray sources with a positional un¬ 
certainty of less than 1" in a given Chandra/ACIS pointing. 
By correcting the positional offset of each X-ray pointing in¬ 
dividually, we are correcting for any positional uncertainties 
between the individual pointings as well as any astrometric 
zero-point error between the X-ray and optical surveys. In 
Eigure|2 we show the median X-ray to optical astrometric off¬ 
set and its corresponding error for X-ray sources with optical 
counterparts in each of the 27 optical sub-fields and in each of 
the 126 X-ray pointings. Because each optical sub-field con¬ 
tains approximately 4 X-ray pointings, the optical sub-fields 
will have astrometric offsets correlated with that of the X-ray 
pointings but will have smaller uncertainties due to the larger 
number of sources in each field. We therefore assumed no 
field-to-field variation in the astrometric offset of each opti¬ 
cal sub-field and only corrected for any field-to-field variation 
in the astrometric offset of each X-ray pointing. In practice, 
we only adjusted the X-ray source positions of the sources if 
there were 15 or more X-ray sources with a positional uncer¬ 
tainty of less than 1" (corresponding to sources with high X- 
ray counts and/or sources near the center of the X-ray point¬ 
ing). In all pointings with less than 15 such sources, we used 
the median offset of sources in the full sample. The largest 
correction we applied in any sub-field was ^0."9 although in 
most cases, it was ~0"5. The remaining astrometric offset 
and error are small (< 0"3 in each optical sub-field) and are 
shown in Eigure|3 In Table 2, we include the corrected X-ray 
source positions. The optical positions remain unchanged. 

Objects brighter than /?«17 are saturated in the typical ND¬ 
WES imaging exposures. This can result in large uncertain¬ 
ties in the measured positions and optical magnitudes of these 
sources. To correct the positions in the small number of af¬ 
fected sources, we matched the X-ray catalog to objects in 
the 2MASS catalog which has more accurate astrometry than 
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Fig. 1.— The spatial distribution of X-ray sources (denoted by black circles). Also plotted are the positions of the 126 X-ray pointings of the Chandra 
ACIS camera (diamond shaped boxes) and the 27 optical imaging sub-fields (i?-band images; lai'ge shaded area). X-ray sources whose positions are outside the 
boundaries of the NDWFS Bootes field are denoted with lightly shaded circles and are excluded from the analysis of this paper. 


Before astrometry correction 




Median RA Offset (arcsec) Median RA Offset (arcsec) 


Fig. 2.— The median X-ray to optical astrometric offsets and their corresponding errors for X-ray sources with optical counterparts in each of the 27 optical 
sub-fields (left) and each of the 126 X-ray pointings (right). The 5 X-ray pointings containing less than 15 matched sources are represented by dashed error bars. 
In these cases, we corrected the source positions using the median offset found for all the X-ray pointings. 
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shallow optical surveys such as the DSS. We applied the same 
astrometric offset that was used for the X-ray to NDWFS ref¬ 
erence frames as well as an additional known offset of 0"377 
east and 0"147 north to shift the 2MASS coordinates to that 
of the NDWFS. If the position of the 2MASS counterpart was 
within 10" of a bright (R <17) NDWFS counterpart, we used 
the corrected position of the 2MASS source for the nearest 
NDWFS source. 

3.2. Matching Optical Counterparts 

To identify the optical counterpart of each X-ray source, 
one must decide on the matching criteria to be adopted, i.e., at 
what angular separation can one be reasonably confident that 
an optical source is the true optical counterpart of an X-ray 
source. A compromise is required between having a complete 
sample and minimizing contamination from unrelated optical 
sources. Identifying the true optical counterpart is made eas¬ 
ier by the excellent positional accuracy of Chandra which has 
typical 90% uncertainties of Ri0."6 for a bright (> 10 count) 
on-axis source. However, the uncertainty in the X-ray posi¬ 
tion is a strong function of both the off-axis distance of the 
X-ray source from the X-ray pointing center and the strength 
of the X-ray source. The deep optical catalogs make it more 
likely that an X-ray source has a true optical counterpart but 
the high source density at faint optical apparent magnitudes 
also increases the risk of chance superpositions. 

To properly quantify these issues, we adopted a Bayesian 
identification scheme that assigns probabilities to optical 
sources of being the true optical counterpart rather than using 
a simple matching criteria. The Bayesian scheme is a more 
sophisticated approach than using a simple matching radius 
because it can self-consistently include additional information 
which is contained within the dataset itself. The basis of our 
method was to evaluate the probability of each of the opti¬ 
cal sources surrounding an X-ray source being more probable 
than a “background” source. Our method also supplies a for¬ 
mal estimate of there being no identifiable optical counterpart 
down to the optical limits of our survey. The Bayesian source 
identification method is described in AnnendixiAl 

4. RESULTS 

In this Section, we present the results of the Bayesian 
matching scheme and compare this to a simple matching algo¬ 
rithm. The results from Monte Carlo simulations are reported. 
We then present the optical catalog. 

4.1. Match Statistics 

4.1.1. Fraction of X-ray Point Sources with Optical 
Counterparts 

Of the 3,213 X-ray sources detected in the Bootes field, we 
found a likely optical counterpart for 3,140 (98%). Because 
all X-ray sources with near-IR identifications also have opti¬ 
cal counterparts, we hereafter refer to both the near-IR and 
optical counterparts in the NDWFS catalog as the “optical” 
counterparts. Some of the X-ray sources have more than one 
potential optical counterpart in our data set. For our analysis 
of the sample properties in the remainder of the paper, we al¬ 
ways used the most probable counterpart. For completeness, 
however, our catalog includes all candidate optical counter¬ 
parts with more than a 1% probability of being the matched 
source. 

In Figure |3 we show the fraction of X-ray sources with 
counterparts brighter than a given magnitude in each optical 


band. The total fraction of sources with an optical counterpart 
in each band is less than the total fraction in all bands because 
some sources are not detected in all the bands. The fraction of 
X-band matches has been corrected for the fact that the area 
of the X-band survey is smaller (only 59% of the sources that 
are surveyed in the other bands have coverage in the X-band). 
The number of optical counterparts is similar for each of the 
Bw,R, and I bands. However, only ~45% of X-ray sources 
have counterparts in the X-band, because the NDWFS X-band 
imaging is significantly shallower than the optical imaging for 
typical X-ray sources. We di scuss the optically non-detected 
X-ray sources in Section lsT^ 

4.1.2. Comparison with a Simple Matching Algorithm 

We have compared our Bayesian matching scheme with a 
simple matching scheme in which we define the most likely 
optical counterpart to be the closest optical match within ei¬ 
ther l."5 or the positional uncertainty of the X-ray source 
(Renter et al. 2005), whichever is greater. Using this sim¬ 
ple scheme, we find an optical counterpart for 2875 (89%) 
of the X-ray sources. When performing the simple matching 
scheme on a sample in which we shift the X-ray position by 
30" (i.e. a “random” sample), we find at least one optical 
counterpart for 38% of the sources. This high spurious iden¬ 
tification rate occurs because the NDWFS images are very 
deep and have a high number density of faint sources at any 
position. However, unlike the real data, the optical magnitude 
distribution in a random sample will be dominated by opti¬ 
cally faint sources. This method will therefore overestimate 
the number of spuri ous so urces. A more rigorous method is 
presented in Section l4.1.3l 

The Bayesian matching scheme picks the same optical 
counterpart as the simple matching scheme in 88% of cases. 
Of the 372 X-ray sources whose optical counterparts do not 
agree, 266 have an optical counterpart in the Bayesian match¬ 
ing scheme but no cataloged optical counterpart in the simple 
matching scheme; one has an optical counterpart in the sim¬ 
ple matching scheme but no cataloged optical counterpart in 
the Bayesian matching scheme; and 105 have different opti¬ 
cal counterparts. If we compare the optical counterpart ob¬ 
tained using the simple scheme to that of either the first or 
second most likely optical counterpart in the Bayesian match¬ 
ing scheme, only 3% of the matches disagree. 

We conclude that there is a reasonable agreement between 
the two matching schemes. The differences are primarily due 
to the fact that the Bayesian scheme is a more sophisticated al¬ 
gorithm, allowing for the inclusion of more information in the 
determination of a probable match. For example, the fact that 
many of the X-ray sources which have no cataloged optical 
counterpart in the simple scheme have larger positional off¬ 
sets between the X-ray and optical sources suggests that the 
positional uncertainty in the simple matching scheme may be 
underestimated. The X-ray positional error is defined as the 
50% enclosed energy radius divided by the square root of the 
number of counts (Renter et al. 2005). We do not expect that 
the true X-ray position will always fall within the formal po¬ 
sitional error and if we increased the matching radius to en¬ 
sure that we did, we would obtain unacceptable numbers of 
incorrect matches. In the Bayesian scheme, the positional un¬ 
certainty is determined self consistently from the X-ray and 
optical data themselves. In addition, the simple scheme picks 
only the nearest cataloged optical counterpart. If there is a 
brighter optical source (which has a lower probability of be¬ 
ing there by chance) slightly further away and the X-ray po- 
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Fig. 3.— The median X-ray to optical astrometric offsets and their con'esponding errors for X-ray sources with optical counterparts in each of the 27 optical 
sub-fields (left) and each of the 126 X-ray pointings (right) after the astrometric correction was applied. 



Fig. 4.— The fraction of X-ray sources with optical counterparts brighter 
than a given magnitude in the 5w-band (dashed line), i?-band (solid line), 
/-band (dotted line) and X-band (dot-dashed line). The mean 50% complete¬ 
ness magnitude limits are shown as vertical lines with the same linestyles 
as above used for each band. The AT-band completeness is corrected for its 
smaller survey area. 

sitional error is large, the Bayesian scheme may assign this to 
be the true optical counterpart. 

4.1.3. Monte Carlo Simulations 

To further test the reliability of our Bayesian technique, 
we ran Monte Carlo simulations of the optical and X-ray 
datasets to model the problem of matching the X-ray point 
sources from the XBootes Survey to the NDWFS optical cat¬ 
alog. The Monte Carlo results provide confidence in our over¬ 
all approach and a basis for interpreting the results for the 
real NDWFS/XBootes data. In particular, we use the Monte 
Carlo tests to understand the completeness of the identifica¬ 
tions (hereafter IDs). 

We modeled the X-ray positional uncertainties by a Gaus¬ 


sian (Eqn. with a constant, aa = 0."5, the PSF width at 
the pointing center, ctq = 0."5, and the quadratic growth of the 
PSF width as one moves off-axis, creoo = 4"0. We produced 
synthetic background catalogs by taking the XBootes catalog 
and generating new optical catalogs for the regions around 
each X-ray source. We used broken power-law models for the 
number counts, 

( 1 ) 

dm 

empirically normalized to match the observed number counts 
of the NDWFS field. The total number of sources was nor¬ 
malized to match the ~ 10^ observed sources in the matching 
regions used for the X-ray sources. The X-ray sources were 
modeled with three regions, a = 0.4 for 15 < m < 20, a = 0 
for 20 < m < 23, and a = -0.3 for 23 < m < 26, based on 
the magnitudes of the most probable optical IDs for the X-ray 
sources in the central dk < 400."0 regions of the ACIS fields. 
The background optical source positions were distributed ran¬ 
domly in the matching region for each X-ray source. To test 
our ability to recognize X-ray sources genuinely lacking op¬ 
tical counterparts, we assumed that a fraction (1-/) of the 
X-ray sources have no optical counterpart. For the sources 
with counterparts, the position of the counterpart was ran¬ 
domly drawn based on the “PSF model” using the observed 
X-ray counts and off-axis distance (Q and dk) for each X-ray 
source. 

Fig. 0 shows the results for estimating the fraction of 
sources with optical counterparts / with the PSF parame¬ 
ters fixed to their most likely values (era = ctq = 0."5 and 
CTgoo = 4"0). For the full range of / = 0.99 to / = 0.5, the 
posterior probability distribution for / tracks its true value, 
but is shifted to higher completeness by approximately a fac¬ 
tor of two. The origin of the shift can be traced to the unit 
ambiguities in the ratio Mjk/Bk (see Appendix) and the dif¬ 
ferent optical magnitude distributions of the background and 
X-ray sources. In experiments where we drew the magnitudes 
of the X-ray sources from the magnitude distribution of the 
background sources, the posteriori estimates for / were con¬ 
sistent with the input values. For present purposes, we should 
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Fig. 5.— The posterior probability for the fraction of X-ray sources lacking 
optical counterparts (1 —/) for Monte Carlo simulations in which 99%, 95%, 
90%, 70%, and 50% of the X-ray sources possess optical counterparts. The 
vertical dashed lines mai'k the input values, and in each case the posteriori 
estimates are shifted by roughly a factor of two towards higher completeness. 
If we draw the magnitudes of the X-ray sources from the magnitude distri¬ 
bution of the background sources we find no offset. These simulations were 
done with the “PSF” parameters fixed to their most likely values. 

regard / as a factor correcting the likelihood balance between 
ID and no ID for these ambiguities rather than as an absolute 
measurement of the completeness. In all experiments, how¬ 
ever, the posteriori estimate of / robustly tracked the input 
completeness. 

We now focus on the case with / = 95%, as it most closely 
matches the real data. Fig. shows that the probability that 
an optical identification is correct closely tracks the estimated 
probability {Pit) both for the field as a whole and in the cen¬ 
tral regions. Over the full field, the true optical source is the 
most probable match 92% of the time. It is the first or second 
most probable match over 98% of the time. As one would 
expect, most of the matching errors are in the outskirts of the 
X-ray ACIS pointings where the PSF is large or when the op¬ 
tical counterpart is faint. For the X-ray sources within 400" 
of the telescope axis, the true optical counterpart is the most 
probable match over 95% of the time, and it is the first or sec¬ 
ond most probable match over 99% of the time. Similarly, if 
the most probable optical counterpart has R< 22 mag, then it 
is the true counterpart 96% of the time. At least in our sim¬ 
ulations, misidentified X-ray sources are always matched to 
another optical source. There was no case of an X-ray source 
with a counterpart for which having no optical ID was the 
most probable result. Of the X-ray sources without optical 
counterparts, the most probable ID has no optical counterpart 
roughly 40% of the time and it is the first or second most 
probable ID 67% of the time. The percentages of correctly 
matched sources again improve considerably to 48% and 82% 
respectively in the inner parts of the ACIS pointing. These 
suffer from significant Poisson uncertainties (because there 
are only 73 sources without optical IDs) but are typical. Thus, 
assigning an optical counterpart to X-ray sources genuinely 
lacking an optical counterpart is a significantly greater prob¬ 
lem than the reverse. These results are generic and change 
little if we consider simulations with larger or smaller incom¬ 
pleteness. 



match probability 


Fig. 6.— The solid, dashed and dotted curves show the fraction of the 
time that the most likely, second most likely or “no ID” optical identification 
is correct as a function of their probability value for the Monte Carlo simula¬ 
tions with / = 95%. We show the results for both the full ACIS field (thick 
line) and the inner regions of the ACIS field (Jj < 40070; light line). We 
do not show the “no ID” results for the inner regions due to the poor number 
statistics. 

Fig.El shows the results for the PSF parameters and the 
completeness. The completeness estimate is / = 98% in¬ 
dependent of whether we use the logarithmic or CXO pri¬ 
ors. Given the bias we observed in the Monte Carlo simu¬ 
lations, this suggests that the real completeness is / ~ 95%. 
The systematic term in the PSF depends little on the priors 
((To = 0736 versus 0733 for the logarithmic and CXO priors). 
The other PSF parameters do show a dependence on the pri¬ 
ors, with CTo — 0."0 and aeoo = 5."2 for the logarithmic prior 
and (To = 0724 and a^oo = 478 with the CXO prior. However, 
despite the apparent parameter differences between the results 
for the two priors, the posterior probabilities for the identifi¬ 
cations are virtually identical. 

In summary, the Monte Carlo simulations provide confi¬ 
dence in applying our Bayesian matching scheme to the iden¬ 
tification of optical counterparts from the NDWFS to the X- 
ray sources in the XBootes Survey. By simulating the X-ray 
sources with true optical counterparts exhibiting similar prop¬ 
erties to that expected in the data itself, we demonstrate that 
the Bayesian method successfully recovers the true optical 
counterparts as the most probable counterpart 92% of the time 
(and the first or second most probable 98% of the time). The 
main errors occur in assigning the wrong optical counterparts 
or in assigning optical counterparts to sources with no true 
optical counterpart and this problem is likely to be generic to 
all matching methods. Although the measured completeness 
is «98%, the true completeness is likely to be f«95%. 

4.2. The Optical Catalog and Images of the X-ray Point 
Sources 

In Table 2 we present a sample of the matched op¬ 
tical and X-ray catalog for the Chandra sources in the 
Bootes field of the NDWFS. The complete version of 
this table is in the electronic edition of the Journal. The 
X-ray properties of each X-ray source are presented in 
Renter et al (2005) (the sources can be cross-matched 
using their X-ray names). The full catalog contain- 
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Fig. 7.— Posteriori estimates of parameters for the real NDWFS/XBootes 
data. The solid curves use the logarithmic priors and the dashed curves use 
the CXO priors. The hatched regions in the creoo panel show estimates of 
(T 600 ffoni the 50% (lower e dge) and 90% (upper edge) enclosed energy radii 
at 1.5 and 6.4 keV (see Ean. lAQI for parameter definitions). 

ing both the X-ray and optical parameters can be found at: 
ftp://archive.noao.edu/pub/catalogs/xbootes/xbootes_cat_xray 
_opt_lR_21jun_vl.0.txt and off the NDWFS homepage 
(http: //WWW. noao. edu/noao/noaodeep/1. 

The best matching criteria depends on the science to 
be undertaken and for this reason, we include in this data 
release all multiply matched sources (with >1% probability 
of being the correct optical counterpart) as well as infor¬ 
mation regarding their optical magnitude, distance to the 
X-ray source and probability of being the optical counterpart. 
Within our electronic table we include the following columns: 

(1) Name of X-ray source 

(2) X-ray Right Ascension 2000 shifted to NDWFS astromet¬ 
ric frame (hours) 

(3) X-ray Right Ascension 2000 shifted to NDWFS astromet¬ 
ric frame (minutes) 

(4) X-ray Right Ascension 2000 shifted to NDWFS astromet¬ 
ric frame (seconds) 

(5) X-ray Declination 2000 shifted to NDWFS astrometric 
frame (degrees) 

(6) X-ray Declination 2000 shifted to NDWFS astrometric 
frame (minutes) 

(7) X-ray Declination 2000 shifted to NDWFS astrometric 
frame (seconds) 

(8) X-ray Right Ascension 2000 shifted to NDWFS astromet¬ 
ric frame (total degrees) 

(9) X-ray Declination 2000 shifted to NDWFS astrometric 
frame (total degrees) 

(10) Name of X-ray pointing 

(11) Probability of X-ray source having an optical counterpart 

(12) Number of optical sources with >1% probability of 
being associated with the X-ray source 

(13) Optical rank (l=most probable in running order to least 
probable) 

(14) Bayesian probability of match 

(15) Identification flag (1 - optical ID; -1 - no optical ID. In 
cases of no optical ID being the most probable identihcation, 
the following flag values have been applied manually: -3 - no 


optical ID, but source obscured by nearby star / missing data; 
-2 - no optical ID, but X-ray position is close to optically 
bright galaxy (source is either obscured by or associated with 
the galaxy); -1 - ’true’ no optical ID (optical image is truly 
blank)). 

(16) NDWFS optical name (Null if no optical ID) 

(17) Optical (NDWFS) Right Ascension 2000 (hours; Null if 
no optical ID) 

(18) Optical (NDWFS) Right Ascension 2000 (minutes; Null 
if no optical ID) 

(19) Optical (NDWFS) Right Ascension 2000 (seconds; Null 
if no optical ID) 

(20) Optical (NDWFS) Declination 2000 (degrees; Null if no 
optical ID) 

(21) Optical (NDWFS) Declination 2000 (minutes; Null if no 
optical ID) 

(22) Optical (NDWFS) Declination 2000 (seconds; Null if no 
optical ID) 

(23) Optical (NDWFS) Right Ascension 2000 (total degrees; 
Null if no optical ID) 

(24) Optical (NDWFS) Declination 2000 (total degrees; Null 
if no optical ID) 

(25) Offset of Optical and X-ray positions (Null if no optical 
ID) 

(26) Bvr-band magnitude (Vega; Null: no optical ID, -1: no 
data in this band^, 0: no detection in this band (but detected 
in another band)) 

(27) R-band magnitude (Vega; Null if no optical ID, -1: no 
data in this band\ 0: no detection in this band (but detected 
in another band)) 

(28) /-band magnitude (Vega; Null if no optical ID, -1: no 
data in this band\ 0: no detection in this band (but detected 
in another band)) 

(29) X-band magnitude (Vega; Null if no optical ID, -1: no 
data in this band (generally because there is no X-band image 
coverage), 0: no detection in this band) 

(30) /?n^-band 50% completeness limit (-1: no data in this 
band') 

(31) /?-band 50% completeness limit (-1: no data in this 
band') 

(32) /-band 50% completeness limit (-1: no data in this 
band') 

(33) X-band 50% completeness limit (-1: no data in this 
band) 

(34) Optical Stellarity Parameter (taken from from either the 
R, Bw, or /-band catalogs in that order of preference; Null if 
no optical ID) 

(35) Name of optical sub-field in which source is found 

Examining the optical images of the X-ray sources is im¬ 
portant both in verifying that the matching criteria are sensi¬ 
ble and in determining the nature of the X-ray sources. We ex¬ 
tracted Bw, R, /,and X cut-out images for all the XBootes X- 
ray sources with unambiguous matches to check the matches 
by eye. In Figure |3 we show examples for a bright galaxy 
which appears to be either an interacting galaxy or a galaxy 
at the center of a cluster, an optical point source and an op¬ 
tically non-detected X-ray source. Optical cut-out images 
can be obtained using the NOAO cut-out service located at 
http://archive.noao.edu/ndwfs/cutout-form.html 

^ This may occur for a small number of sources in which the position on 
the optical image coincides with a masked out region due to a nearby bright 
star, cosmic ray, or other problem 














Table 2. Table showing the optical characteristic of the matched optical / X-ray catalog for the Chandra sources in 
the Bootes field of the NDWFS. The complete version of this table is in the electronic edition of the Journal. The 
printed edition contains only a sample. The X-ray properties are presented in Kenter et al (2005). 


(1) CXOName 
(16) NDWFSName 

(2,3,4) nxra 
(13) optrank 

(5,6,7) nxdec 
(14) bayprob 

(8) xnradeg 
(15) flag 

(9) xndecdeg 
(17,18,19) ora 

(10) CXOfield 
(20,21,22) odec 

(11) optprob 
(25) offset 

(12) nopt 
(26,30) Bw (lim) 

(27,31) R (lim) 

(28,32) I (lim) 

(29,33) K (lim) 


CXOXB J142433.0+342819 

14:24:33.02 

+34:28:19.75 

216.1375833 

34.4721528 

3652 

1.00 

1 




NDWFS J142433.0 1 342818 

1 

1.00 

1 

14:24:33.06 

+34:28:18.73 

1.13 

0.00(26.47) 

23.23(25.16) 

0.00(25.15) 

0.00(19.00) 

CXOXB J142434.0 1 331557 

14:24:34.04 

1 33:15:58.25 

216.1418333 

33.2661806 

4256 

1.00 

1 




NDWFS J142433.9+331558 

1 

1.00 

1 

14:24:33.98 

+33:15:58.77 

0.90 

19.95(26.51) 

19.63(25.65) 

18.95(24.77) 

-l.OO(-l.OO) 

CXOXB J142434.4+331340 

14:24:34.46 

+33:13:41.79 

216.1435833 

33.2282750 

4256 

1.00 

1 




NDWFS J142434.3+331343 

1 

1.00 

1 

14:24:34.38 

+33:13:43.16 

1.64 

20.33(26.51) 

19.79(25.65) 

19.34(24.77) 

-l.OO(-l.OO) 

CXOXB J142435.4+353855 

14:24:35.37 

+35:38:55.42 

216.1473750 

35.6487278 

3604 

1.00 

1 




NDWFS J142435.3+353856 

1 

0.98 

1 

14:24:35.32 

+35:38:56.26 

1.04 

23.39(26.69) 

21.04(25.62) 

19.81(24.69) 

18.00(19.20) 

CXOXB J142435.7+345012 

14:24:35.73 

+34:50:13.56 

216.1488750 

34.8371000 

3636 

1.00 

2 




NDWFS J142435.7+345013 

1 

0.88 

1 

14:24:35.70 

+34:50:13.90 

0.54 

0.00(26.41) 

0.00(25.26) 

24.38(24.73) 

0.00(19.20) 

NDWFS J142435.4+345012 

2 

0.10 

1 

14:24:35.49 

134:50:12.24 

3.33 

23.38(26.41) 

22.80(25.26) 

22.37(24.73) 

0.00(19.20) 

CXOXB J142435.7+354224 

14:24:35.74 

1 35:42:24.53 

216.1489167 

35.7068139 

3604 

1.00 

1 




NDWFS J142435.7+354224 

1 

1.00 

1 

14:24:35.74 

+35:42:24.00 

0.53 

21.54(26.69) 

20.60(25.62) 

19.96(24.69) 

18.18(19.20) 

CXOXB J142435.8+325131 

14:24:35.86 

+32:51:32.12 

216.1494167 

32.8589222 

4268 

1.00 

2 




NDWFS J142435.8+325132 

1 

0.59 

1 

14:24:35.88 

+32:51:32.84 

0.76 

0.00(26.84) 

23.13(25.82) 

22.77(25.17) 

-l.OO(-l.OO) 

NDWFS J142435.8+325130 

2 

0.40 

1 

14:24:35.84 

i 32:51:30.42 

1.71 

0.00(26.84) 

22.69(25.82) 

22.08(25.17) 

-l.OO(-l.OO) 

CXOXB J142437.64 331152 

14:24:37.63 

1 33:11:53.43 

216.1567917 

33.1981750 

4256 

1.00 

1 




NDWFS J142437.7+331153 

1 

0.99 

1 

14:24:37.72 

+33:11:53.21 

1.19 

20.56(26.51) 

20.31(25.65) 

19.98(24.77) 

-l.OO(-l.OO) 

CXOXB J142437.9+354404 

14:24:37.88 

+35:44:04.84 

216.1578333 

35.7346778 

3604 

1.00 

1 




NDWFS J142437.9+354404 

1 

1.00 

1 

14:24:37.95 

+35:44:04.45 

0.88 

25.67(26.69) 

22.08(25.62) 

20.86(24.69) 

17.75(19.20) 

CXOXB J142438.1+334245 

14:24:38.20 

+33:42:46.00 

216.1591667 

33.7127778 

4241 

0.63 

3 




NDWFS J142438.3+334241 

1 

0.59 

1 

14:24:38.36 

1 33:42:41.68 

4.76 

0.00(26.59) 

0.00(25.48) 

24.24(25.33) 

-l.OO(-l.OO) 

No Optical ID 

2 

0.37 

-1 




(26.59) 

(25.48) 

(25.33) 

(-1.00) 

NDWFS J142438.7+334247 

3 

0.03 

1 

14:24:38.70 

+33:42:47.00 

6.23 

0.00(26.59) 

0.00(25.48) 

24.77(25.33) 

-l.OO(-l.OO) 

CXOXB J142438.5+322649 

14:24:38.55 

+32:26:50.51 

216.1606250 

32.4473639 

4282 

1.00 

1 




NDWFS J142438.4+322651 

1 

1.00 

1 

14:24:38.47 

+32:26:51.09 

1.17 

21.49(26.84) 

20.10(25.82) 

19.46(25.17) 

-l.OO(-l.OO) 

CXOXB J142439.8+340757 

14:24:39.84 

+34:07:58.34 

216.1660000 

34.1328722 

4225 

1.00 

1 




NDWFS J142439.9+340759 

1 

1.00 

1 

14:24:39.94 

+34:07:59.89 

1.94 

22.75(26.47) 

21.38(25.16) 

20.51(25.15) 

17.99(19.00) 

CXOXB J142440.0+345137 

14:24:40.07 

+34:51:37.94 

216.1669583 

34.8605389 

3636 

0.98 

2 




NDWFS J142440.1+345136 

1 

0.98 

1 

14:24:40.19 

+34:51:36.21 

2.32 

23.73(26.41) 

22.67(25.26) 

22.01(24.73) 

0.00(19.20) 

No Optical ID 

2 

0.02 

-1 




(26.41) 

(25.26) 

(24.73) 

(19.20) 


VO 
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CXOXB J142651.0+322217 



CXOXB J143545.7+333901 



Fig. 8 .— Example optical finding charts for a bright galaxy possibly in an interacting system (top), an optical point source (middle) and an optically non- 
detected X-ray source (bottom). The finding charts are 30'' wide and centered on the X-ray position, with North up and East to the left. The X-ray positional 
uncertainties are lf'25, Of'86, and If'Ol from top to bottom. 
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5. THE X-RAY AND OPTICAL PROPERTIES OF THE MATCHED 
SAMPLE 

In this Section, we investigate the X-ray and optical prop¬ 
erties of the matched population. We use the optical mor¬ 
phology to determine the fraction of sources whose optical 
light is dominated by point-like emission (AGN and stars) 
and extended emission (galaxies whose AGN emission is ob¬ 
scured from view). By plotting their color-color distribution, 
we determine the approximate redshift range and galaxy type 
or amount of reddening of the AGN. We also investigate the 
properties of AGN whose optical emission is heavily atten¬ 
uated (dehned by sources with a high X-ray to optical flux 
ratio, which includes sources with no optical counterpart). 

5.1. Optical Morphology 

For our simple morphologic al analysis, we used the S ource- 
Extractor stellarity parameter (iBertin & Arnoutsll996h which 
has values between 0 (for sources with the most extended ra¬ 
dial profiles) and 1 (for point-like sources well characterised 
by their point spread functions). We made no correction 
for the variable seeing (0"77<FWHM<1"53 in the 27 R- 
band sub-fields). In Figure |3 we show the stellarity param¬ 
eter against the apparent /?-band magnitude for all optically 
matched X-ray sources. If we classify all sources with stel¬ 
larity > 0.7 as point-like and all sources with stellarity < 0.7 
as (extended) galaxies, there are 1279 optical point sources 
and 1861 galaxies. SExtractor may fail to correctly clas- 
sify sources fainte r than /?=23 in the NDWFS data (see, e.g., 
iBrown et al.l200.3l) and this is apparent in the spread of stellar¬ 
ity values at faint /?-band magnitudes in Figure If we only 
consider the sources with R <23 counterparts, which com¬ 
prise 76% of the sample, there are 1151 point sources and 
1304 galaxies. 

The point sources are likely to be those in which the opti¬ 
cal light is dominated by a central AGN, although the point 
sources with the highest optical magnitudes are likely to be 
X-ray bright stars (this is confirmed by optical spectroscopy; 
Kochanek et al. in prep.). The stellarity of many sources with 
R < \1 may be less than 1.0 due to saturation effects. The 
extended sources are those in which the optical light is domi¬ 
nated by the stars in the host galaxy. The vast majority of the 
sources must contain a powerful AGN due to their luminous 
X-ray emission and they are likely to be optical type II AGN 
in which the optical AGN emission is obscured from view. 
There is a small population of optically bright {R < 17), X- 
ray faint galaxies (stellarity «0); the X-ray emission in these 
objects is likely to be dominated by high-mass X-ray bina¬ 
ries (HMXBs) whose X-ray emis sion is correlated w ith the 
star formation rate of the galaxy (iGrimm et al.ll2003li . Even 
in very active star-forming galaxies (SFR« 100 M 0 yr“*), the 
integrated hard X-ray lum inosity from HMXB s should only 
be L:c « 5 X 10"^^ergs s“' (iGrimm et al.ll2003li . Because the 
XBootes survey is relatively X-ray shallow, such X-ray emis¬ 
sion should only be detectable within galaxies at z <0.15. 
Although the stellarity parameter becomes unreliable at faint 
optical magnitudes, there is a significant number of R <23 
sources with intermediate values of stellarity. These sources 
could be bright galaxies with a Seyfert-like central AGN. 

5.2. Color-Color distributions 

Color-color diagrams can be used to compare the color dis¬ 
tributions of the sample to that expected for model galaxy 
and AGN templates and hence to infer the general charac- 






12 14 16 18 20 22 24 26 

R magnitude _ 

Fig . 9.— The SourceExtractor stellarity parameter iBertin & Amout j 
fT^ plotted against the apparent i?-band magnitude for all optically 
matched X-ray sources. The size of both the dots and arrows are scaled 
relative to the X-ray flux; there is a strong correlati on b etween the optical 
magnitude and X-ray flux of the sources (see Sectionj^^J. A larger stellarity 
parameter is obtained for the more point-like objects in the optical image. 
Aitows denote objects with no /?-band detection. In these cases we plot the 
/?-band 50% completeness limit for the appropriate optical sub-field; the stel¬ 
larity parameter has been measured from another band in which a detection 
is measured. The dotted line shows the boundary that we have assumed in 
dividing our sample into point sources (stellarity>0.7) and extended sources 
(stellarity<0.7). 

teristics of a population in terms of spectral type and red- 
shift. In Figure cni we show the Bw -R versus R-I color- 
color diagrams (Vega magnitudes are used throughout). We 
divided the sample into optically extended sources (stellar¬ 
ity < 0.7) and optical point sources (stellarity > 0.7) and 
into optically bright (17.0< R <21.5) and optically fainter 
(21.5< R <23.0) sources. We do not consider the optically 
faintest sources because the optical stellarity index is not well 
determined by our images in these cases. The optically bright 
sources classified as point sources and as extended sources 
generally occupy different regions of color-color space. For 
the extended sources, we overplot the predicted colors of 
Z=0-3 galaxies using PEGASE2 spectral synthesis models 
( iFioc & Rocca-Volmerangelll997h that assume exponentially 
declining star formation rates with e-folding times between 
t=0.7 and 15 Gyr (“r” models) and a formation redshift of 
Zf «4. For the point sources (unobscured AGN), we plot the 
predicted colors of z=0.1-3.25 quasars by redshifting a com¬ 
posite rest frame SDSS quasar spectrum (IVanden Berk et all 
1200 Ih . We account for the effects of absorption by interven¬ 
ing QSO ab sorption system s as a function of redshift using 
the model of iMadaul jl99-5ft . 

The majority of the optically bright (17.0< R <21.5) ex¬ 
tended sources fall on a well defined region of the color-color 
diagram consistent with them being moderate redshift (z < 1 ) 
galaxies with a range of t values (low r values are consis¬ 
tent with nearby red early-type galaxies and higher r values 
are consistent with bluer galaxies with higher observed star 
formation rates). As one would expect, the optically fainter 
galaxies generally appear to be at higher redshift than the 
optically brighter galaxies. Their distribution in color-color 
space is more dispersed. This is probably due to optically 
faint sources spanning a larger range of redshifts. At fainter 
optical magnitudes, the uncertainties in the magnitudes be- 
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Fig. 10.— Color-color diagrams for all optically matched X-ray sources with an /?-band detection {R <23.0). The sample is split into optically bright 
{R <21.5; top), and optically fainter (21.5< R <23.0; bottom) sources. We divide the sample into optically extended sources (stellarity < 0.7; filled circles; 
left) and optical point sources (stellarity > 0.7; stars; right). The symbols are scaled by the logarithm of their X-ray flux (larger symbols denote brighter X-ray 
sources). Overplotted are the color-color tracks for PEGASE2 model galaxies (left) and the SDSS composite AGN spectrum (right). The arrow denotes the 
direction of reddening expected if there is additional inti'insic reddening of the sources due to dust. 


come larger and SExtractor starts failing in correctly separat¬ 
ing galaxies from point sources and this may also contribute 
to the larger dispersion in color-color space. 

In Figure d we show that the majority of the optically 
bright (17.0< R <23.0) point sources fall in a diffe rent re¬ 
gion o f color-color space than that predicted for stars (iPicklesI 
Il998h . Comparison to the color-color tracks of a model quasar 
SED as a function of redshift suggests that the majority of op¬ 
tically bright point sources are quasars at redshifts z=0-3. At 
z > 3, the SED becomes strongly attenuated in the observed 
band; the color-color track will continue to rise to very 
red Bw-R colors. There are relatively few sources occupying 
the color-color space in which one would expect quasars at 
z > 3 to lie. This is not surprising given the shallow nature 
of our X-ray survey compared to the deep Chandra pointings 
and that we are only considering R <23 sources. 

Several effects (dust reddening, redshift, relative fractions 
of galaxy and AGN light, contamination of broad-band col¬ 


ors by strong line emission, mis-classification, etc.) are un¬ 
doubtedly responsible for the observed scatter in the optical 
colors of the X-ray sources. Nevertheless, we note the exis¬ 
tence of a significant population of red sources in this sample, 
with some suggestion that the median color of the optically 
fainter sources is redder than that of the optically brighter 
sources. For the extended sources, this reddening is prob¬ 
ably largely due to the galaxy redshift increasing to fainter 
magnitudes. For the point sources, however, dust reddening 
is likely to be the cause of the redder median color. The most 
reddened sources may not be present in the optically selected 
quasar surveys such as the SDSS from which w e have used the 
compo site SEP to calculate our m odel tracks. iRichards et al.l 
ll2003l) and iHopkins et al.l ll2004l) find that the reddening of 
quasars is dominated by SMC-like dust at the quasar redshift. 
We use th e parametric extinction laws within the SMC of 
IpHHH to determine how this reddening would affect the 
position of quasars in color-color space. Indeed, it appears 
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that there may be an increasingly large fraction of reddened 
quasars at fainter optical fluxes. A visual examination of the 
small number of sources with extreme colors {Bw-R > 3) 
reveals that they are genuine sources with very faint Bw emis¬ 
sion rather than sources with problems in their optical pho¬ 
tometry or a significant contribution from stellar light. This 
suggests that their UV radiation is strongly attenuated either 
because they are at high redshift or because they have signifi¬ 
cant amounts of dust which preferentially absorbs the light at 
low wavelengths. 

5.3. The X-ray to Optical Flux Ratio of the matched sample 

Traditional optically selected samples of AGN may only 
detect a fraction of the overall population because in many 
sources dust obscures the AGN core from view at optical 
wavelengths. We can study the properties of this optically ob¬ 
scured population based on samples of X-ray selected AGN 
with a high X-ray to optical flux ratio, defined as: 

log/x//o = log(/.t)-i-^-9.53, (2) 

dMcHardv et al.l20()^ where is the total (0.5-7 keV) X-ray 
flux in units of 10“^^ergs cm“^ s“' and R is the apparent R- 
band magnitude. In Figure^J we show the X-ray flux against 
f?-band magnitude. Overplotted is the region 0-1 < fx/fo < 
10. St ar-forming galaxies tend to fall in the region frJ fn< 0.1 
(e.g., iGiacconi et alJ 1201)11 iHornschemeier et alil200l() and 
optically un-obscured AGN tend to populate the shaded re¬ 
gion (0.1 < fr /fn <10) (e.g. , iBarger et'5]l2003l ISteffen et alJ 
120041 l^lverrnan et alJl20()4l) . The fx/fo >10 sources are ex¬ 

pected to be the optically obscured AGN population (although 
high redshift X-ray sources may also hav e lar ge fx/fo)- The 
optically non-identifled sources (Section l5.4> fit within this 
category. 

We calculate the median /?-band magnitude for different X- 
ray flux bins (using logarithmic binning). We And that the me¬ 
dian value for the optical point sources lies along the f^ = fo 
line (see Figure CD. The median value for the optically ex¬ 
tended sources lies further above the f^ = fo line, but the scat¬ 
ter is large enough that the median value is still consistent with 
fx = fo- In Figure E] we illustrate how the extended sources 
have a larger mean X-ray to optical fl ux ratio than the point 
sources. A KS-test JPress et alJll992^ shows that the X-ray 
to optical flux ratio distributions of the two populations are 
drawn from a different underlying distribution at a >99.9% 
significance. This is probably due to the fact that for the ex¬ 
tended sources, the optical emission is from the stellar pop¬ 
ulation (i.e., the host galaxy light dominates either because 
it is optically brighter or because the central AGN is com¬ 
pletely optically obscured) whereas in the case of the point 
sources, the optical emission is dominated by an optically less 
obscured AGN. One must be careful because the optical clas¬ 
sification may be unreliable at R >23 (see Figure |D and this 
may affect the classification of sources with fx/fo >5. How¬ 
ever, our conclusions still hold when considering only R <23 
sources (albeit with larger uncertainties). 

We define the 773 sources W\\hfx/fo >10 as our candidates 
for an optically obscured AGN population (we have omitted 
the 26 sources that are not truly optically blank; see Sec- 
tion l5.4t . The population consists of 510 sources that are de¬ 
tected in 1?, 217 that are only detected in another optical band, 
and 47 that are optically non-detected in all optical bands to 
the limits of our survey. All of the non-detected sources have 


fx/fo >10. The majority of the fx/fo >10 sources (643/773) 
have extended optical emission (stellarity <0.7), suggesting 
that the optical light is dominated by the stars in the host 
galaxy (i.e. the optical emission from the AGN is heavily 
obscured). This is also true for the R <23 sources {YUM 
are optically extended), suggesting that the unreliability of the 
stellarity parameter at fainter optical magnitudes is not bias¬ 
ing this result. We note that for the sources with low X-ray 
counts, the X-ray flux may have been over-estimated due to 
Eddington bias. This effect is discussed in Renter et al. (2005) 
and may mean that fx/fo < 10 for a fraction of our optically 
obscured AGN sample. 

The X-ray hardness ratio is a useful quantity to approximate 
the spectral shape of the X-ray emission in cases for which the 
counting statistics are insufficient to determine the shape of 
the X-ray spectrum and the redshift distribution is unknown. 
The X-ray hardness ratio is defined as: 


HR = 


(Ch - Cs) 

(Q + C,)’ 


(3) 


where Ch and Cs are the counts in the hard (2-7 keV) and soft 
(0.5-2 keV) bands respectively (Renter at al. 2005). In gen¬ 
eral, the intrinsic X-ray spectral slope is not thought to vary 
significantly from source to source. We therefore assume that 
changes in the hardness ratio can be attributed to a variation in 
the absorption column of gas which preferentially attenuates 
the soft X-rays. The hardness ratio spans the range -1 (only 
soft band counts) to 1 (only hard band counts: appropriate for 
obscured AGN in which the soft photons are absorbed by an 
intervening column of gas). 

In Figure CD we show the distribution of X-ray hardness 
ratio for the optically obscured AGN sample (fx/fo >10) 
compared to that of the (renormalized) main AGN sample 
(0.1< fx/fo <10). The optically obscured AGN sample ex¬ 
hibits a slightly broader distribution of X-ray hardness ratios 
whereas the Q.\< fx/fo <10 sample is more dominated by 
soft X-ray sources. A RS test rules out the hypothesis that 
the two hardness ratio distributions are drawn from the same 
underlying distributions at a probability of >99.9%. This sug¬ 
gests that AGN that are obscured by dust in the optical are also 
more likely to be obscured in the soft X-ray by gas. How¬ 
ever, there is clearly no direct correspondence between these 
quantities: many of the optically obscured X-ray sources ex¬ 
hibit hardness ratios consistent with them being unobscured 
in the X-ray and conversely, many optically unobscured X- 
ray sources exhibit hardness ratios consistent with them be¬ 
ing obscured in the X-ray. This is a si milar result to tha t 
found for a smaller number of sources bv lWang et alJ J2004I) . 
The sources with fx/fo >10 are generally optically fainter 
(by selection) and may be at a higher redshift than those with 
0.1< /r//o <10. If this is the case, the negative k-correction 
in the X-ray will result in similarly obscured sources exhibit¬ 
ing softer X-ray emission, and this may weaken the correla¬ 
tion between sources which appear X-ray obscured and opti¬ 
cally obscured. 


5.4. Sources with no Optical Counterparts 

Of the 3,213 X-ray sources, 73 have the greatest probabil¬ 
ity of having no cataloged optical counterpart in any of the 
single-band catalogs (Bw,R,I,K) in the NDWFS. This high 
completeness is due to the fact that we will correctly opti¬ 
cally identify most of our X-ray sources because the XBootes 
survey is relatively shallo w in th e X-ray but deep in the op¬ 
tical. We show in Section 14.1.31 that although the complete- 
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Fig. 11.— The total (0.5-7 keV) X-ray flux against 7?-band magnitude for all extended sources (stellaiity < 0.7; circles; left) and point sources (stellarity > 
0.7; stars; right). Note that the small amount of vertical structure at low X-ray fluxes is due to quantized source counts. We only plot sources with R >17.0 as 
saturation effects may result in inaccurate optical magnitudes in the brightest cases. Sources with R <23 will have less reliable stellarity values. Arrows denote 
objects with no 7?-band detection. In these cases we plot the 7?-band limit for the appropriate optical sub-field. Small arrows denote objects with a detection in 
another optical band. Large arrows denote optically non-detected sources. Because there is no information on their stellarity, these sources are plotted on both 
figures. The (yellow) shaded region denotes the region in which 0.1< fx/fo <10; our fx/fo >10 sample lies above this region. The large crosses show the 
median R magnitude and standard deviation around the mean for different X-ray flux bins. 





Fig. 12.— Histogram of the number of optically extended sources (ob¬ 
scured AGN; dotted line) and optical point sources (unobscured AGN; solid 
line) as a function of the X-ray to optical flux ratio for all X-ray sources with 
a detection in the /?-band. 

ness of our matched catalog is ~98%, the true completeness 
is likely to be only ~95%. This is because the number den¬ 
sity of faint optical sources is high and so we will have mis- 
identified some true optically unidentified sources as matches 
due to chance alignments. We therefore note that our sam¬ 
ple of sources with no cataloged optical counterparts is only 
^40% complete and so further optically blank sources exist 
but are not identified (but this is an unavoidable consequence 
in any matching technique of such catalogs). 

We directly examined the positions of the X-ray sources 
in the NDWFS images to determine whether these remaining 
sources truly have no optical counterparts brighter than our 
survey limits. We divided the sources into those that were 
near an optically bright star or galaxy (i.e., in regions where 



X-ray Hardness Ratio 


Fig. 13.— Histogram showing the number of sources as a function of 
hardness ratio for all X-ray sources with fx/fo >10 (773 sources; hashed), 
0.1< fx/fo <10 (2280; blue shaded), and the optically unidentified sources 
(47; black solid). The optically unidentified sources all have fx/fo >10. 

the optical imaging or catalog data are compromised or the 
X-ray emission may come from a ULX in the outskirts of the 
galaxy) and those that appear to be truly optically blank to the 
NDWFS survey limits. Ten sources lie close to a bright star or 
galaxy, but the star or galaxy was not a plausible identification 
because of the accuracy of the X-ray and optical positions. 
Because the X-ray position is offset from the optical position 
of the star or galaxy, the true optical counterpart is probably 
obscured by the bright star or galaxy, which happens to be 
near our line of sight to the X-ray source. Throughout our 
analysis, we have assumed that the intrinsic positions of the 
X-ray and optical emission are identical. However, in some 
cases, the X-ray emission may be coming from sources off¬ 
set from the center of the galaxy. Sixteen sources are near 
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to a low-redshift, bright galaxy which is either obscuring the 
true source from view (as in the above sources), or host to the 
X-ray source. In the latter case, the X-ray source could be a 
bright high-mass X-ray binary (HMXB) which is offset from 
the center of the galaxy. Such sources have been obs erved in 
the Chandra Deep Field JHornschemeier et al.ll2004|) . How¬ 
ever, the brightest X-ray binaries found in the local Uni- 
verse have X-ray lu minosities no larger than ^ 10"^° ergs s“' 
dGrimm et al.l2003l) and so for HMXBs to be individually de¬ 
tected above the flux limit of our survey, the galaxy would 
have to be at low redshift; z=0.02 or nearer. 

For the remaining 47 of the optically unidentified X-ray 
sources, there is clearly no optical counterpart in any of the 
NDWFS bands which could be associated with the X-ray 
emission. We expect some of these sources will be spuri¬ 
ous X-ray detections. From analysis of archival ACIS-I back¬ 
ground data, we expect f«35 X-ray sources in the XBootes 
survey to be spurious (Renter et al. 2005). Our Monte Carlo 
simulations suggest that spurious sources will be matched to 
an optical source approximately 60% of the time (see Sec¬ 
tion therefore we expect only «14 (30%) of the opti¬ 

cally unidentified X-ray sources to be spurious X-ray detec¬ 
tions. 

The following properties of the optically unidentified X-ray 
sources give us further confidence that our sample is not dom¬ 
inated by spurious X-ray detections. We measured the 4" di¬ 
ameter aperture magnitude for the 47 optically non-identified 
X-ray sources and found a significant signal in the majority of 
cases; 22, 29, and 33 of these sources have > 3cr signal over 
the local background in the Bw,R, and /-band respectively. 
We expect the majority of the spurious X-ray sources to fall 
at large off-axis distances and to have X-ray counts close to 
the survey limit (Renter et al. 2005). The X-ray sources with 
no optical identification have a similar distribution in their off- 
axis distances to the X-ray sources with optical counterparts, 
suggesting that they are not dominated by spurious detections. 
In general, they have lower X-ray counts than sources with op¬ 
tical counterparts (with a median X-ray counts of 5 compared 
to 6 for the sources with optical counterparts) but this may be 
because they are at higher redshift and/or are obscured in both 
the optical and soft X-ray band. 

We conclude that these sources are most likely to be dom¬ 
inated by powerful high redshift and/or optically obscured 
quasars. They are an extr eme subset of the fx/fo >10 sources 
discussed in Section ISJl Collectively, the optically unidenti- 
hed sources have a hardness ratio of-0.08±0.005 (135 soft 
counts, 116 hard counts). A RS-test shows that, like the 
fx/fo >10 sources, the hardness ratio distribution is different 
from that of the 0.1< fx/fo <10 sources at a >99.9% proba¬ 
bility (see Figure [131 . This suggests that like the fx/fo >10 
sources, they have, on average, large column densities of gas 
obscuring the soft X-ray emission. 

6 . SUMMARY AND DISCUSSION 


We have presented the catalog of optical counterparts of the 
3,213 X-ray point sources in the Bootes held of the NOAO 
Deep Wide-Field Survey. Our Bayesian technique hnds op¬ 
tical counterparts for 98% of the X-ray sources. This pro¬ 
vides the basis for further investigation of the properties of 
the X-ray sources. In particular, a large program of optical 
spectroscopy with the Hectospec instrument on the MMT is 
underway, targeting all («1900) X-ray sources with R <21.5 
or / <21.5 optical counterparts (AGES; Rochanek et al. in 
prep.). 

Approximately half of the XBootes sources are identihed 
with optical point sources; the other half have extended op¬ 
tical counterparts dominated by extended galaxy light. The 
bright (R < 23.0) optical point sources fall in a region of op¬ 
tical color-color space consistent with them being quasars at 
z <3. The fainter point sources appear to be redder in their 
optical colors, suggesting that they may be more obscured 
QSOs. The optical colors of the extended sources suggest that 
they are a combination of z < 1 early-type galaxies and bluer 
star-forming galaxies. 

The large area. X-ray shallow and optically deep nature of 
the XBootes survey allows us to identify a large sample of 
773 bright X-ray sources with high X-ray to optical flux ra¬ 
tios {fx/fo > 10). We interpret these large X-ray to optical 
flux ratios as resulting from extinction of the optical light 
by dust. These X-ray sources are generally harder in their 
X-ray spectra than the bulk of the X-ray source population 
(0.1 < fx/fo < 10), suggesting that AGN that are obscured 
by dust in the optical are also more likely to be obscured 
in the soft X-ray by gas. Such a large sample of bright X- 
ray sources with extreme properties holds much promise for 
follow-up multiwavelength spectroscopy and detailed studies 
of obscured AGN. An extreme subset of the fx/fo > 10 pop¬ 
ulation are 47 X-ray sources which have no optical identifica¬ 
tion down to /? ~ 25.5, the average 50% completeness limit 
of the NDWFS /?-band images. 
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APPENDIX 

BAYESIAN SOURCE IDENTIEICATION 

In this Section, we outline our Bayesian method (developed by CSR.) for identifying the most probable optical counterpart 
for each X-ray source in the XBootes survey. This method provides a formal approach to assigning a probability to each optical 
source in the vicinity of the X-ray source being the true optical counterpart (as well as the probability of there being no cataloged 
optical counterpart down to the limit of the optical survey). In our implementation, the Bayesian method requires information on 
the X-ray counts and off-axis distance for each X-ray source, the angular distance of all optical sources from the X-ray source 
in the vicinity of each X-ray source, and their optical magnitudes. It then uses the data itself to self-consistently estimate each 
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individual X-ray positional error, the probability of an optical source of a given magnitude and distance being there by chance, and 
ultimately the probability of each neighboring optical source being the true optical counterpart. Although the method described 
here is applied to the specific problem of determining the most probable optical counterpart for each X-ray source in the XBootes 
survey, we have tried to keep the discussion as general as possible to allow the method to be extended to similar problems. 

We begin by estimating the likelihood of an optical source in the absence of an X-ray source - i.e., the probability of the 
background. From the overall optical held counts we know the differential number counts of sources, B = dN/dm, down to 
the limit of our optical catalog In practice, we have used the observed NDWFS counts rather than a theoretical model 

because it implicitly includes the selection effects of the survey and the catalog. We now want to compute the probability of 
hnding k = 1 • • n, sources with magnitu des nii, in a region o f area A, . Imagine subdividing the region into inhnitesimal bins of 
angular size Ax and magnitude Am (see lKenner et alJ[l999ll . Inhnitesimal bins are either empty or contain a single source with 
an expected number of sources in a bin of Na = Ax^ AmB{m). Thus, the probability of an empty bin is exp(-Ao,), the probability 
of a bin containing an object is Na exp{-Na), and the log-likelihood of the data is 

lnL= ^ (-Na)+ Y. (-Na + inNa). (Al) 

empty bins full bins 

Notice that the Na terms are summed over all bins, while the InAc terms are summed only over bins containing sources, so this 
becomes 


lnL= ^ (-A„)+ ^ (ln(A„)) (A2) 

all bins full bins 

fii 

\nL = -AiN{mii„,) + Y^^^Bk (A3) 

k=\ 

where Nirnnm) is the integrated number counts per unit area and the sum k = 1 • • •«, extends over the n, optical sources in the held. 
A constant term that depends on the arbitrary bin widths has been dropped. We imagine doing this background calculation over 
a very large region so that the hrst term is unaffected by the inclusion of a known X-ray source in the region. Thus in the absence 
of an X-ray source, the probability of the data (D,) given our background model is 


P{Di) = Ptack = exp (-AiN(mii„,)) iTAiBk (A4) 

The statistics of an X-ray source in the held are not governed by Pback- If we can identify X-ray source i and optical source k 
with probability Mik, then we should drop that source from the calculation of the background probability. The calculation will 
(briehy) appear to be very complex beca use we must i nclude all possible discrete identihcations and then select between them. 
The met hod we now outline is based on lPressHl997ft. While the initial results for the probabilities are very similar to those 
found bv ide Ruiter. Willis. & Ami lll976h or lBennl()l983h . our approach has greater formal clarity and lends itself to some useful 
extensions. 

We include all possible identihcations by introducing a binary vector v** for the case in which optical source k (k = 1 • • • n,) 
is identihed with X-ray source i, where n, is the total number of possible identihcations in the held around the X-ray source. 
Each vector is a binary code whose entries are = 0 for k and v‘^= 1 for a = k (i.e. the true identihcation). We allow for 
the possibility that none of the optical sources are the identihcation by adding the extra vector F® all of whose entries are zero. 
We also introduce a probability / for an X-ray source possessing an optical identihcation. The objective of the calculation is to 
determine the probabilities of the different possible identihcations = Pik and the mean completeness /. Since our possible 
identihcations are exhaustive, the total probability is simply the sum over all the mutually exclusive possibilities: 


P(D,■|v‘^/) = exp(-A,A(m/,,„)) 




( 1-0 


;t=l 


(A5) 


This may look overly complex, but the structure of the identihcation vectors v*^ allows us to simplify it to 


P(D,■|F^/) = n„B„exp(-AW(m/,m)) 


(1-/) + /^^'^ 
k=l 


O' 


(A6) 


P(Di\P\f)= Pback 


Hi 

(i-/)+/E 

k=l 


Bk 


(Al) 


The leading term of 1 - / is the case of no identihcation (k = 0), and the sum covers the possibility of identifying X-ray source i 
with each of the k = 1 • • • n, optical sources. Thus, the relative likelihoods of the identihcations are determined by the ratios of the 
likelihoods that an optical source is coincident with the X-ray source compared to the likelihood that it is just there by chance. 
Since we intend to use a hxed background model we can simply drop the Pback term as it will have no effect on the subsequent 
results. It should be retained if the model of the background is going to be optimized as part of the later calculations. The simplest 


















Optical IDs for the XBootes Survey 


17 


model of the probability of optical source k being associated with X-ray source i is a Gaussian model incorporating the distance 
of the optical source given the X-ray positional error, 


1 

Mik = -— 



(A8) 


where rtk is the distance of the optical source from the X-ray source and at is the uncertainty in the relative position, which can 
be modeled by 






CTq+ 0-600 


600."0 


H 2 


(A9) 


The first term, Ua, models any remaining systematic astrometry problems that do not depend on the X-ray flux. The second 
term models the position- and flux-dependent Chandra PSF, with cto giving the width at the pointing center (dk = 0) and creoo 
describing the quadratic growth of the PSF width off the axis. The accuracy with which an X-ray source is centroided improves 
as the number of counts increases. We therefore scale with the number of X-ray counts Q. We will refer to this as the “PSF 
model”. 

The derivation for multiple ID possibi lities is more elaborate and we used Cartesian rather than polar coordinates, but our 
basic expressions are identical to those ofH en The probabilities always depend on the ratio Mik/Bk, which ought to be 

dimensionless. If we use the number counts as a function of magnitude Bk (units mag“' deg“^) and the probability for the source 
position in Ean. lA8l (units deg“^), then the ratio is not in some senses dimensionless. This does not matter if we force all sources 
to have optical IDs (/ ^ 1), but the probability of finding no ID for a given source and the posterior probability distribution 
P{f\D) are affected by this problem. We could include the probability that an X-ray source of a given flux has a given optical 
magnitude, but this adds more parameters than we presently want to explore. Instead we use 


Mik = 





(AlO) 


where am is the dispersion of the optical catalog in magnitudes. This simple modification lead to better quantitative results. 

So far nothing in our calculation is very explicitly Bayesian. However, we now want to invert the probability of the data given 
the model parameters P(D|p), to obtain the probability of the parameters given the data /"(plD). Doing this inversion requires a 
prior probability P(p) for the parameters, since probability calculus says that 


F(p|D)(xP(D|p)F(p). 


(All) 


In all subsequent equations (with a oc sign) there is an implicit normalization such that 


J ^fpP(D|p)P(p)= 1, 


(A12) 


which we do not include in the equations because it is too cumbersome. It is the introduction of the priors (the /"(p)) which tends 
to generate nervousness about Bayesian calculations. In our present calculation, if the IDs depend on the priors then one has a 
much more basic, practical problem than issues of statistical principle. 

If we understand our matching model (the Mjk) perfectly, then the probability of the source IDs given the data is 

P(v'^/|D) (X P(v'V(/)m■|v-'^/). (A13) 

We assume that the prior likelihoods of all possible identihcations are equal (P(v'^) = 1). For the probability of an X-ray source 
having an optical identification {Pif)), we use both logarithmic priors and rough Gaussian priors based on the known character¬ 
istics of the Chandra point spread function (cto=0"6±0."2 and CT6oo=5 .''0± 1 "6). In practice, the use of these different priors yield 
similar results. To start with, we assume that the average probability / of an X-ray source having an identihcation is constant. 
The probability of identifying X-ray source i optical source k is then 


Pik = P(^^\D) = f 


Bk 


(i-/)+/E 


i=i 


B, 


(A14) 


while the probability of having no ID is 


P,o = P(f'“|D) = (!-/) 


(l-/) + /E 

/=! 


Bi 


(A15) 


and these probabilities can be evaluated for each individual source. Note that the probability for individual sources is not governed 
solely by the average probability - if M ,//Bi is a large number, meaning that it is extremely unlikely to find an optical source that 
closely associated with the X-ray source by chance, then Pik 1 and F,o^O independent of /. But / should really just be treated 
as another Bayesian parameter, so we should estimate our identihcation probabilities by marginalizing over / rather than using a 
hxed value. Similarly, by marginalizing over all possible identihcations, we obtain an estimate of the probability distribution for 
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/. We can also include additional, global parameters. For example, if there are uncertainties in the accuracy of the astrometry, 
then the matching probability Mu may also have some unknown or ill-constrained parameters. 

Once we include a set of global parameters, which we will call p, then we must work with the probability of matching all the 
sources rather than that of matching individual sources, 

P{^^,p\D) (X P(p)n,.F(A|v*',p), (Alb) 

where we now take the product i=l---N over all the X-ray sources and add a prior P(p) for the new parameters. The expression 
assumes a uniform prior for the identification vectors P{9^) =1.^ In order to estimate the probability of identifying optical 
counterpart / with X-ray source i, we must now marginalize the probability over all the parameters p as well as all the other X-ray 
sources and their identifications. If we define 


Pu = P{P'\D) cx Pi, = J t/pF(p)/^^^^F(D,|f'^p), 

(A17) 

for optical source 1, and 


PiO = P{P‘\D)^p,o = J t^pF(p)(l-/)^„,^•F(A■|f■^p), 

(A18) 

for the possibility of no identification, then the probability of identifying optical source / with X-ray source i is 

P,- P'' 

PiQ + l^i=iPii 

(A19) 

and the probability of having no identification is 

p PiO 

?0 

PiO + 2^l=\Pil 

(A20) 

where we have now included the normalizing denominators. If the mean probability of an identification / is the only parameter, 
the only difference from Fnns.lATdlandlATSlis that the identification probabilities for X-ray source i are weighted by the prob¬ 
ability distribution for / estimated from its prior and the data for all other sources rather than using a specified value. Similarly 
we can determine the probability distributions for the extra parameters by marginalizing over all the other parameters and all the 
source identifications to And that the probability distribution for parameter p„, is 

P{pm\D)o^ j u^p,^,„F(p)^,•F(A|v"^p). 

(A21) 


While some care must be exercised in ordering the calculation and in avoiding variable overflow problems, this algorithm is 
easily and compactly implemented. The only significant limitation is that each new parameter p requires an additional integral 
for the identification probabilities, eventually leading to the usual difficulties accompanying the numerical calculation of many¬ 
dimensional integrals. 
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